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SEQUENCE LISTING 



(1) GENERAL INFORMATION : 
(i) APPLICANT: RABIN, Mark B. 

(ii) TITLE OF INVENTION: MUTATIONS IN THE BRCA1 

GENE 

(iii) NUMBER OF SEQUENCES: 10 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Howrey & Simon 

(B) STREET: 1299 Pennsylvania Avenue, N.W. 

(C) CITY: Washington 

(D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP: 20004 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows 

(D) SOFTWARE: FastSEQ for Windows Version 2.0b 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Mendelson, Elliot C 

(B) REGISTRATION NUMBER: P42,878 

(C) REFERENCE/ DOCKET NUMBER: 05371.0032.999 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-383-7073 

(B) TELEFAX: 202-383-6610 

(C) TELEX: 

(2) INFORMATION FOR SEQ ID NO:l: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5710 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 

5 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

AGCTCGCTGA GACTTCCTGG ACCCCGCACC AGGCTGTGGG GTTTCTCAGA TAACTGGGCC 60 

~la*Sr CCTGCGCTCA GGAGGCCTTC ACCCTCTGCT CTGGGTAAAG TTCATTGGAA CAGAAAGAA^ 120 

*"' r 1TGGATTTATC TGCTCTTCGC GTTGAAGAAG TACAAAATGT CATTAATGCT AT G C AG AAAA^ 180 

10 TCTTAGAGTG TCCCATCTGT CTGGAGTTGA TCAAGGAACC TGTCTCCACA AAGTGTGACC 24 0 

Q ACATATTTTG CAAATTTTGC ATGCTGAAAC TTCTCAACCA GAAGAAAGGG CCTTCACAGT 300 

# GTCCTTTATG TAAGAATGAT ATAACCAAAA GGAGCCTACA AGAAAGTACG AGATTTAGTC 360 

y3 AACTTGTTGA AGAGCTATTG AAAATCATTT GTGCTTTTCA GCTTGACACA GGTTTGGAGT 4 20 

HI ATGCAAACAG CTATAATTTT GCAAAAAAGG AAAATAACTC TCCTGAACAT CTAAAAGATG 4 80 

If AAGTTTCTAT CATCCAAAGT ATGGGCTACA GAAACCGTGC CAAAAGACTT CTACAGAGTG 54 0 

m AACCCGAAAA TCCTTCCTTG CAGGAAACCA GTCTCAGTGT CCAACTCTCT AACCTTGGAA 600 

|Tj CTGTGAGAAC TCTGAGGACA AAGCAGCGGA TACAACCTCA AAAGACGTCT GTCTACATTG 660 

[fl AATTGGGATC TGATTCTTCT GAAGATACCG TTAATAAGGC AACTTATTGC AGTGTGGGAG 720 

J" ATCAAGAATT GTTACAAATC ACCCCTCAAG GAACCAGGGA TGAAATCAGT TTGGATTCTG 780 

|P CAAAAAAGGC TGCTTGTGAA TTTTCTGAGA CGGATGTAAC AAATACTGAA CATCATCAAC 8 40 

CCAGTAATAA TGATTTGAAC ACCACTGAGA AGCGTGCAGC TGAGAGGCAT CCAGAAAAGT 900 

jjjj ATCAGGGTAG TTCTGTTTCA AACTTGCATG TGGAGCCATG TGGCACAAAT ACTCATGCCA 960 

^ GCTCATTACA GCATGAGAAC AGCAGTTTAT TACTCACTAA AGACAGAATG AATGTAGAAA 1020 

LS AGGCTGAATT CTGTAATAAA AGCAAACAGC CTGGCTTAGC AAGGAGCCAA CATAACAGAT 1080 

|f GGGCTGGAAG TAAGGAAACA TGTAATGATA GGCGGACTCC CAGCACAGAA AAAAAGGTAG 1140 

H;: ATCTGAATGC TGATCCCCTG TGTGAGAGAA AAGAATGGAA TAAGCAGAAA CTGCCATGCT 1200 

CAGAGAATCC TAGAGATACT GAAGATGTTC CTTGGATAAC ACTAAATAGC AGCATTCAGA 12 60 

AAGTTAATGA GTGGTTTTCC AGAAGTGATG AACTGTTAGG TTCTGATGAC TCACATGATG 1320 

GGGAGTCTGA ATCAAATGCC AAAGTAGCTG ATGTATTGGA CGTTCTAAAT GAGGTAGATG 1380 

30 AATATTCTGG TTCTTCAGAG AAAATAGACT TACTGGCCAG TGATCCTCAT GAGGCTTTAA 14 4 0 

TATGTAAAAG TGAAAGAGTT CACTCCAAAT CAGTAGAGAG TAATATTGAA GACAAAATAT 1500 

TTGGGAAAAC CTATCGGAAG AAGGCAAGCC TCCCCAACTT AAGCCATGTA ACTGAAAATC 1560 

TAATTATAGG AGCATTTGTT ACTGAGCCAC AC?ATAATACA AGAGCGTCCC CTCACAAATA 1620 

AATTAAAGCG TAAAAGGAGA CCTACATCAG GCCTTCATCC TGAGGATTTT ATCAAGAAAG 1680 

35 CAGATTTGGC AGTTCAAAAG ACTCCTGAAA TGATAAATCA GGGAACTAAC CAAAC GGAGC 17 40 

AGAATGGTCA AGTGATGAAT ATTACTAATA GTGGTCATGA GAATAAAACA AAAGGTGATT 1800 

CTATTCAGAA TGAGAAAAAT CCTAACCCAA TAGAATCACT CGAAAAAGAA TCTGCTTTCA 18 60 

AAACGAAAGC TGAACCTATA AGCAGCAGTA TAAGCAATAT GGAACTCGAA TTAAATATCC 1920 

ACAATTCAAA AGCACCTAAA AAGAATAGGC TGAGGAGGAA GTCTTCTACC AGGCATATTC 1980 

40 ATGCGCTTGA ACTAGTAGTC AGTAGAAATC TAAGCCCACC TAATTGTACT GAATTGCAAA 204 0 

TTGATAGTTG TTCTAGCAGT GAAGAGATAA AGAAAAAAAA GTACAACCAA AT GCC AG TCA 2100 

GGCACAGCAG AAACCTACAA CTCATGGAAG GTAAAGAACC TGCAACTGGA GCCAAGAAGA 2160 

GTAACAAGCC AAATGAACAG ACAAGTAAAA G AC AT G AC AG TGATACTTTC CCAGAGCTGA 2220 

AGTTAACAAA TGCACCTGGT TCTTTTACTA AGTGTTCAAA TACCAGTGAA CTTAAAGAAT 2280 

45 TTGTCAATCC TAGCCTTCCA AGAGAAGAAA AAGAAGAGAA ACTAGAAACA GTTAAAGTGT 2340 
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CTAATAATGC 


TGAAGACCCC 


AAAGATCTCA 


TGTTAAGTGG 


AGAAAGGGTT 


TTGCAAACTG 


2400 




AAAGATCTGT 


AG AG AG TAG C 


AGTATTTCAC 


TGGTACCTGG 


TACTGATTAT 


GGCACTCAGG 


2460 




AAAGTATCTC 


GTTACTGGAA 


GTTAGCACTC 


TAGGGAAGGC 


AAAAACAGAA 


CCAAATAAAT 


2520 




GTGTGAGTCA 


GTGTGCAGCA 


TTTGAAAACC 


CCAAGGGACT 


AATTCATGGT 


TGTTCCAAAG 


2580 


5 


ATAATAGAAA 


TGACACAGAA 


GGCTTTAAGT 


ATCCATTGGG 


ACATGAAGTT 


AACCACAGTC 


2640 




GGGAAACAAG 


CATAGAAATG 


GAAGAAAGTG 


AACTTGATGC 


TCAGTATTTG 


CAGAATACAT 


2700 




TCAAGGTTTC 


AAAGCGCCAG 


TCATTTGCTC 


TGTTTTCAAA 


TCCAGGAAAT 


GCAGAAGAGG 


2760 




AATGTGCAAC 


ATTCTCTGCC 


CACTCTGGGT 


CCTTAAAGAA 


ACAAAGTCCA 


AAAGTCACTT 


2820 


10 


TTGAATGTGA 


ACAAAAGGAA 


GAAAATCAAG 


GAAAGAATGA 


GTCTAATATC 


AAGCCTGTAC 


2880 


AGACAGTTAA 


TATCACTGCA 


GGCTTTCCTG 


TGGTTGGTCA 


GAAAGATAAG 


CCAGTTGATA 


294 0 




ATGCCAAATG 


TAGTATCAAA 


GGAGGCTCTA 


GGTTTTGTCT 


ATCATCTCAG 


TTCAGAGf^rA 


3000 




ACGAAACTGG 


ACTCATTACT 


CCAAATAAAC 


ATGGACTTTT 


ACAAAACCCA 


TATCGTATAP 


1060 




CACCACTTTT 


TCCCATCAAG 


TCATTTGTTA 


AAACTAAATG 


TAAGAAAAAT 


CTGCTAGAGG 


3120 




AAAACTTTGA 


GGAACATTCA 


ATGTCACCTG 


AAAGAGAAAT 


GGGAAATGAG 


AACATTCCAA 




is 


GTACAGTGAG 


CACAATTAGC 


CGTAATAACA 


TTAGAGAAAA 


TGTTTTTAAA 


GGAGCrAGPT 


3240 




CAAGCAATAT 


TAATGAAGTA 


GGTTCCAGTA 


CTAATGAAGT 


GGGCTCCAGT 


ATTAATGAAA 

n x x nn x \jj.nx ixi 


3300 


fi l 


TAGGTTCCAG 


TGATGAAAAC 


ATTCAAGCAG 


AACTAGGTAG 


AAACAGAGGG 


CCAAAATTGA 


3360 




ATGCTATGCT 


TAGATTAGGG 


GTTTTGCAAC 


CTGAGGTCTA 


TAAACAAAGT 


CTTCCTGGAA 


3420 




GTAATTGTAA 


GCATCCTGAA 


ATAAAAAAGC 


AAGAATATGA 


AGAAGTAGTT 


CAGACTGTTA 


3480 


ib 


ATACAGATTT 


CTCTCCATAT 


CTGATTTCAG 


ATAACTTAGA 


ACAGCCTATG 


GGAAGTAGTC 


3540 




ATGCATCTCA 


GGTTTGTTCT 


GAGACACCTG 


ATGACCTGTT 


AGATGATGGT 


GAAATAAAGG 


3600 




AAGATACTAG 


TTTTGCTGAA 


AATGACATTA 


AGGAAAGTTC 


TGCTGTTTTT 


AGCAAAAGCG 


3660 




TCCAGAGAGG 


AGAGCTTAGC 


AGGAGTCCTA 


GCCCTTTCAC 


CCATACACAT 


TTGGCTCAGG 

X X WWW X WilW w 


3720 




GTTACCGAAG 


AGGGGCCAAG 


AAATTAGAGT 


CCTCAGAAGA 


GAACTTATCT 


AGTGAGGATG 


3780 




AAGAGCTTCC 


CTGCTTCCAA 


CACTTGTTAT 


TTGGTAAAGT 


AAACAATATA 


CCTTCTCAGT 


3840 




CTACTAGGCA 


TAGCACCGTT 


GCTACCGAGT 


GTCTGTCTAA 


GAACACAGAG 


GAGAATTTAT 

vnwjn^i xxx i i x 


3900 




TATCATTGAA 


GAATAGCTTA 


AATGACTGCA 


GTAACCAGGT 


AATATTGGCA 


AAGGCATCTC 


3960 




AGGAACATCA 


CCTTAGTGAG 


GAAACAAAAT 


GTTCTGCTAG 


CTTGTTTTCT 


TCACAGTGCA 


4020 




GTGAATTGGA 


AGACTTGACT 


GCAAATACAA 


ACACCCAGGA 


TCCTTTCTTG 


ATTGGTTCTT 

£i X X W WJ X X W» X X 


4080 


30 


CCAAACAAAT 


GAGGCATCAG 


TCTGAAAGCC 


AGGGAGTTGG 


TCTGAGTGAC 


AAGGAATTGG 


4140 




TTTCAGATGA 


TGAAGAAAGA 


GGAAC GGGCT 


TGGAAGAAAA 


TAATCAAGAA 


GAGCAAAGCA 


4200 




TGGATTCAAA 


CTTAGGTGAA 


GCAGCATCTG 


GGTGTGAGAG 


TGAAACAAGC 


GTCTCTGAAG 


4260 




ACTGCTCAGG 


GCTATCCTCT 


CAGAGTGACA 


TTTTAACCAC 


TCAGCAGAGG 


GATACCATGC 

Wi 1 X £ *V X Vw* w* 


4320 




AACATAACCT 


GATAAAGCTC 


CAGCAGGAAA 


TGGCTGAACT 


AGAAGCTGTG 


TTAGAACAGC 


4380 


35 


ATGGGAGCCA 


GCCTTCTAAC 


AGCTACCCTT 


cCatcataag 


TGACTCCTCT 

X W± lw X \^ X X 


GCCCTTGAGG 


4440 




ACCTGCGAAA 


TCCAGAACAA 


AGCACATCAG 


AAAAAGCAGT 


ATTAACTTCA 


CAGAAAAGTA 


4500 




GTGAATACCC 


TATAAGCCAG 


AATCCAGAAG 


GCCTTTCTGC 


TGACAAGTTT 

x wii^ruiVpj xxx 


GAGGTGTCTG 


4 5 60 




CAGATAGTTC 


TACCAGTAAA 


AATAAAGAAC 


CAGGAGTGGA 


AAGGTCATCC 


CCTTCTAAAT 


4620 




GCCCATCATT 


AGATGATAGG 


TGGTACATGC 


ACAGTTGCTC 


TGGGAGTCTT 


CAGAATAGAA 


4680 


40 


ACTACCCATC 


TCAAGAGGAG 


CTCATTAAGG 


TTGTTGATGT 


GGAGGAGCAA 


CAGCTGGAAG 


4740 




AGTCTGGGCC 


ACACGATTTG 


ACGGAAACAT 


CTTACTTGCC 


AAGGCAAGAT 


CTAGAGGGAA 


4800 




CCCCTTACCT 


GGAATCTGGA 


ATCAGCCTCT 


TCTCTGATGA 


CCCTGAATCT 


GATCCTTCTG 


4860 




AAGACAGAGC 


CCCAGAGTCA 


GCTCGTGTTG 


GCAACATACC 


ATCTTCAACC 


TCTGCATTGA 


4920 




AAGTTCCCCA 


ATTGAAAGTT 


GCAGAATCTG 


CCCAGGGTCC 


AGCTGCTGCT 


C AT ACT ACT G 


4980 


45 


ATACTGCTGG 


GTATAATGCA 


ATGGAAGAAA 


GTGTGAGCAG 


GGAGAAGCCA 


GAATTGACAG 


5040 




CTTCAACAGA 


AAGGGTCAAC 


AAAAGAATGT 


CCATGGTGGT 


GTCTGGCCTG 


ACCCCAGAAG 


5100 




AATTTATGCT 


CGTGTACAAG 


TTTGCCAGAA 


AACACCACAT 


CACTTTAACT 


AATCTAATTA 


5160 
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CTGAAGAGAC TACTCATGTT GTTATGAAAA CAGATGCTGA GTTTGTGTGT GAACGGACAC 5220 

TGAAATATTT TCTAGGAATT GCGGGAGGAA AATGGGTAGT TAGCTATTTC TGGGTGACCC 5280 

AGTCTATTAA AGAAAGAAAA AT GC 7 GAATG AGCATGATTT TGAAGTCAGA GGAGATGTGG 5 34 0 

TCAATGGAAG AAACCACCAA GGTCCAAAGC GAGCAAGAGA ATCCCAGGAC AGAAAGATCT 54 00 

5 TCAGGGGGCT AGAAATCTGT TGCTATGGGC CCTTCACCAA CATGGCCACA GATCAACTGG 54 60 

AATGGATGGT ACAGCTGTGT GGTGCTTCTG TGGTGAAGGA GCTTTCATCA TTCACCCTTG 5520 

GCACAGGTGT CCACCCAATT GTGGTTGTGC AGCCAGATGC CTGGACAGAG GACAATGGCT 5580 

TCCATGCAAT TGGGCAGATG TGTGAGGCAC CTGTGGTGAC CCGAGAGTGG GTGTTGGACA 5 64 0 

GTGTAGCACT CTACCAGTGC CAGGAGCTGG ACACCTACCT GATACCCCAG ATCCCCCACA 57 00 

10 GCCACTACTG A 57 10 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 
|f (A) LENGTH: 18 63 amino acids 

(B) TYPE: amino acid 
j|5 (C) STRANDEDNESS: single 

*fi (D) TOPOLOGY: linear 

^ (ii) MOLECULE TYPE: protein 

H (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

fU Met Asp Leu Ser Ala Leu Arg Val Glu Glu Val Gin Asn Val He Asn 

ip 1 5 10 15 

p Ala Met Gin Lys He Leu Glu Cys Pro He Cys Leu Glu Leu He Lys 

H 20 25 30 

Glu Pro Val Ser Thr Lys Cys Asp His He Phe Cys Lys Phe Cys Met 
35 40 45 

25 Leu Lys Leu Leu Asn Gin ,Lys Lys Gly Pro Ser Gin Cys Pro Leu Cys 

50 55 60 

Lys Asn Asp He Thr Lys Arg Ser Leu Gin Glu Ser Thr Arg Phe Ser 
65 70 75 80 

Gin Leu Val Glu Glu Leu Leu Lys He He Cys Ala Phe Gin Leu Asp 
30 85 * 90 95 

Thr Gly Leu Glu Tyr Ala Asn Ser Tyr Asn Phe Ala Lys Lys Glu Asn 

100 105 110 

Asn Ser Pro Glu His Leu Lys Asp Glu Val Ser He He Gin Ser Met 
115 120 125 

35 Gly Tyr Arg Asn Arg Ala Lys Arg Leu Leu Gin Ser Glu Pro Glu Asn 

130 135 140 

Pro Ser Leu Gin Glu Thr Ser Leu Ser Val Gin Leu Ser Asn Leu Gly 
145 150 155 160 

Thr Val Arg Thr Leu Arg Thr Lys Gin Arg He Gin Pro Gin Lys Thr 
40 165 170 175 

Ser Val Tyr He Glu Leu Gly Ser Asp Ser Ser Glu Asp Thr Val Asn 
180 185 190 
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10 



so 



30 



35 



40 



45 



Lys Ala Thr Tyr Cys Ser Val Gly Asp Gin Glu Leu Leu Gin He Thr 

195 200 205 

Pro Gin Gly Thr Arg Asp Glu He Ser Leu Asp Ser Ala Lys Lys Ala 

210 215 220 

Ala Cys Glu Phe Ser Glu Thr Asp Val Thr Asn Thr Glu His His Gin 
225 230 235 240 

Pro Ser Asn Asn Asp Leu Asn Thr Thr Glu Lys Arg Ala Ala Glu Arg 

245 250 255 

His Pro Glu Lys Tyr Gin Gly Ser Ser Val Ser Asn Leu His Val Glu 

260 265 270 

Pro Cys Gly Thr Asn Thr His Ala Ser Ser Leu Gin His Glu Asn Ser 

275 280 285 

Ser Leu Leu Leu Thr Lys Asp Arg Met Asn Val Glu Lys Ala Glu Phe 

290 295 300 

Cys Asn Lys Ser Lys Gin Pro Gly Leu Ala Arg Ser Gin His Asn Arg 
305 310 315 320 

Trp Ala Gly Ser Lys Glu Thr Cys Asn Asp Arg Arg Thr Pro Ser Thr 

325 330 335 

Glu Lys Lys Val Asp Leu Asn Ala Asp Pro Leu Cys Glu Arg Lys Glu 

340 345 350 

Trp Asn Lys Gin Lys Leu Pro Cys Ser Glu Asn Pro Arg Asp Thr Glu 

355 360 365 

Asp Val Pro Trp He Thr Leu Asn Ser Ser He Gin Lys Val Asn Glu 

370 375 380 

Trp Phe Ser Arg Ser Asp Glu Leu Leu Gly Ser Asp Asp Ser His Asp 
385 390 395 400 

Gly Glu Ser Glu Ser Asn Ala Lys Val Ala Asp Val Leu Asp Val Leu 

405 410 415 

Asn Glu Val Asp Glu Tyr Ser Gly Ser Ser Glu Lys He Asp Leu Leu 

420 425 430 

Ala Ser Asp Pro His Glu Ala Leu He Cys Lys Ser Glu Arg Val His 

435 440 445 

Ser Lys Ser Val Glu Ser Asn He Glu Asp Lys He Phe Gly Lys Thr 

450 455 460 

Tyr Arg Lys Lys Ala Ser Leu Pro Asn Leu Ser His Val Thr Glu Asn 
465 470 475 480 

Leu He He Gly Ala Phe Val Thr Glu Pro Gin He He Gin Glu Arg 

485 490 495 

Pro Leu Thr Asn Lys Leu Lys Arg Lys Arg Arg Pro Thr Ser Gly Leu 

500 505 510 

His Pro Glu Asp Phe He Lys Lys Ala Asp Leu Ala Val Gin Lys Thr 

515 520 525 

Pro Glu Met He Asn Gin Gly Thr Asn Gin Thr Glu Gin Asn Gly Gin 

530 535 540 

Val Met Asn He Thr Asn Ser Gly His Glu Asn Lys Thr Lys Gly Asp 
545 550 555 560 

Ser He Gin Asn Glu Lys Asn Pro Asn Pro He Glu Ser Leu Glu Lys 
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565 570 575 

Glu Ser Ala Phe Lys Thr Lys Ala Glu Pro He Ser Ser Ser He Ser 

530 585 590 

Asa Met Glu Leu Glu Leu Asn He His Asn Ser Lys Ala Pro Lys Lys 
5 595 600 605 

Asn Arg Leu Arg Arg Lys Ser Ser Thr Arg His He His Ala Leu Glu 

610 615 620 

Leu Val Val Ser Arg Asn Leu Ser Pro Pro Asn Cys Thr Glu Leu Gin 
625 630 635 640 

10 lie Asp Ser Cys Ser Ser Ser Glu Glu He Lys Lys Lys Lys Tyr Asn 

645 650 655 

^ Gin Met Pro Val Arg His Ser Arg Asn Leu Gin Leu Met Glu Gly Lys 

O 660 665 670 

# Glu Pro Ala Thr Gly Ala Lys Lys Ser Asn Lys Pro Asn Glu Gin Thr 

fS 675 680 685 

y Ser Lys Arg His Asp Ser Asp Thr Phe Pro Glu Leu Lys Leu Thr Asn 

[If 690 695 700 

El Ala Pro Gly Ser Phe Thr Lys Cys Ser Asn Thr Ser Glu Leu Lys Glu 

lij 705 710 715 720 

if Phe Val Asn Pro Ser Leu Pro Arg Glu Glu Lys Glu Glu Lys Leu Glu 

3 725 730 735 

yk Thr Val Lys Val Ser Asn Asn Ala Glu Asp Pro Lys Asp Leu Met Leu 

p 740 745 750 

kj Ser Gly Glu Arg Val Leu Gin Thr Glu Arg Ser Val Glu Ser Ser Ser 

fff 755 760 765 

p He Ser Leu Val Pro Gly Thr Asp Tyr Gly Thr Gin Glu Ser He Ser 

11 770 775 780 

r Leu Leu Glu Val Ser Thr Leu Gly Lys Ala Lys Thr Glu Pro Asn Lys 

785 790 795 800 

30 Cys Val Ser Gin Cys Ala Ala Phe Glu Asn Pro Lys Gly Leu He His 

805 810 815 

Gly Cys Ser Lys Asp Asn Arg Asn Asp Thr Glu Gly Phe Lys Tyr Pro 

820 825 830 

Leu Gly His Glu Val Asn His Ser Arg Glu Thr Ser He Glu Met Glu 
35 835 840 * 845 

Glu Ser Glu Leu Asp Ala Gin Tyr Leu Gin Asn Thr Phe Lys Val Ser 

850 855 860 

Lys Arg Gin Ser Phe Ala Leu Phe Ser Asn Pro Gly Asn Ala Glu Glu 
365 870 875 880 

40 Glu Cys Ala Thr Phe Ser Ala His Ser Gly Ser Leu Lys Lys Gin Ser 

885 890 895 

Pro Lys Val Thr Phe Glu Cys Glu Gin Lys Glu Glu Asn Gin Gly Lys 

900 905 910 

Asn Glu Ser Asn He Lys Pro Val Gin Thr Val Asn He Thr Ala Gly 
45 915 920 925 

Phe Pro Val Val Gly Gin Lys Asp Lys Pro Val Asp Asn Ala Lys Cys 
930 935 940 
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Ser 


He 


Lys 


Glv 


Gly 


Ser 


Arg 


Phe 


Cys 


Leu 


Ser 


Ser 


Gin 


Phe 


Arg Gly 




945 










950 










955 










960 


• 


Asn 


Glu 


Thr 


Glv 


Leu 
965 


He 


Thr 


Pro 


Asn 


Lys 
970 


His 


Gly 


Leu 


Leu 


Gin 
975 


Asn 


5 


Pro 


Tyr 


Arg 


He 
980 


Pro 


Pro 


Leu 


Phe 


Pro 
985 


He 


Lys 


Ser 


Phe 


Val 
990 


Lys 


Thr 




Lys 


Cys 


Lys 


Lys 


Asn 


Leu 


Leu 


Glu 


Glu 


Asn 


Phe 


Glu 


Glu 


His 


Ser 


Met 








995 










1000 








1005 








Ser 


Pro 


Glu 


Arg 


Glu 


Met 


Gly Asn 


Glu 


Asn 


He 


Pro 


Ser 


Thr 


Val 


Ser 


10 




1010 








1015 








1020 










Thr 


He 


Ser 


Arg 


Asn 


Asn 


He 


Arg 


Glu 


Asn 


Val 


Phe 


Lys 


Gly 


Ala 


Ser 




1025 








1030 








1035 








104 




Ser 


Ser 


Asn 


He 


Asn 


Glu 


Val 


Gly 


Ser 


Ser 


Thr 


Asn 


Glu 


Val 


Gly 


Ser 












1045 








1050 








1055 


% 


Ser 


He 


Asn 


Glu 


He 


Gly 


Ser 


Ser 


Asp 


Glu 


Asn 


He 


Gin 


Ala 


Glu 


Leu 


B I 








1060 








1065 








1070 






Gly Arg Asn 


Arg 


Gly 


Pro 


Lys 


Leu 


Asn 


Ala 


Met 


Leu 


Arg 


Leu 


Gly Val 








1075 








1080 








1085 








Leu 


Gin 


Pro 


Glu 


Val 


Tyr 


Lys 


Gin 


Ser 


Leu 


Pro 


Gly 


Ser 


Asn 


Cys 


Lys 


m 




1090 








1095 








1100 










His 


Pro 


Glu 


lie 


Lys 


Lys 


Gin 


Glu 


Tyr 


Glu 


Glu 


Val 


Val 


Gin 


Thr 


Val 


Li 


1105 








1110 








1115 








112 




Asn 


Thr 


Asp 


Phe 


Ser 


Pro 


Tyr 


Leu 


He 


Ser 


Asp 


Asn 


Leu 


Glu 


Gin 


Pro 












1125 








1130 








1135 




Met 


Gly 


Ser 


Ser 


His 


Ala 


Ser 


Gin 


Val 


Cys 


Ser 


Glu 


Thr 


Pro Asp Asp 










1140 








1145 








1150 






Leu 


Leu 


Asp Asp Gly Glu 


He 


Lys 


Glu 


Asp 


Thr 


Ser 


Phe 


Ala 


Glu 


Asn 








1155 








1160 








1165 






30 


Asp 


He 


Lys 


Glu 


Ser 


Ser 


Ala 


Val 


Phe 


Ser 


Lys 


Ser 


Val 


Gin 


Arg 


Gly 




1170 








1175 








1180 










Glu 


Leu 


Ser 


Arg 


Ser 


Pro 


Ser 


Pro 


Phe 


Thr 


His 


Thr 


His 


Leu 


Ala 


Gin 




1185 










1190 








1195 








120 




Gly 


Tyr 


Arg 


Arg 


Gly Ala 


Lys 


Lys 


Leu 


Glu 


Ser 


Ser 


Glu 


Glu 


Asn 


Leu 


35 










1205 








1210 








1215 


Ser 


Ser 


Glu 


Asp 


Glu 


Glu 


Leu 


Pro 


Cys* Phe Gin 


His 


Leu 


Leu 


Phe 


Gly 










1220 








1225 








1230 






Lys 


Val 


Asn 


Asn 


He 


Pro 


Ser 


Gin 


Ser 


Thr 


Arg 


His 


Ser 


Thr 


Val 


Ala 








1235 








1240 








1245 








Thr 


Glu 


Cys 


Leu 


Ser 


Lys 


Asn 


Thr 


Glu 


Glu 


Asn 


Leu 


Leu 


Ser 


Leu 


Lys 


40 




1250 








1255 








1260 










Asn 


Ser 


Leu 


Asn 


Asp 


Cys 


Ser 


Asn 


Gin 


Val 


He 


Leu 


Ala 


Lys 


Ala 


Ser 




1265 










1270 








1275 








128 




Gin 


Glu 


His 


His 


Leu 


Ser 


Glu 


Glu 


Thr 


Lys 


Cys 


Ser 


Ala 


Ser 


Leu 


Phe 












1285 








1290 








1295 


45 


Ser 


Ser 


Gin 


Cys 


Ser 


Glu 


Leu 


Glu Asp 


Leu 


Thr 


Ala 


Asn 


Thr 


Asn 


Thr 










1300 










1305 










1310 






Gin Asp 


Pro 


Phe 


Leu 


He 


Gly 


Ser 


Ser 


Lys 


Gin 


Met 


Arg 
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1315 1320 1325 

Glu Ser Gin Gly Val Gly Leu Ser Asp Lys Glu Leu Val Ser Asp Asp 

1330 1335 1340 

Glu Glu Arg Gly Thr Gly Leu Glu Glu Asn Asn Gin Glu Glu Gin Ser 
5 1345 1350 1355 136 

Met Asp Ser Asn Leu Gly Glu Ala Ala Ser Gly Cys Glu Ser Glu Thr 

1365 1370 1375 

Ser Val Ser Glu Asp Cys Ser Gly Leu Ser Ser Gin Ser Asp He Leu 
1380 1385 1390 

10 Thr Thr Gin Gin Arg Asp Thr Met Gin His Asn Leu He Lys Leu Gin 

1395 1400 1405 

p Gin Glu Met Ala Glu Leu Glu Ala Val Leu Glu Gin His Gly Ser Gin 

yg 1410 1415 1420 

^ Pro Ser Asn Ser Tyr Pro Ser He He Ser Asp Ser Ser Ala Leu Glu 

1425 1430 1435 144 

Asp Leu Arg Asn Pro Glu Gin Ser Thr Ser Glu Lys Ala Val Leu Thr 
J| 1445 1450 1455 

7Z Ser Gin Lys Ser Ser Glu Tyr Pro He Ser Gin Asn Pro Glu Gly Leu 

•Jlf 1460 1465 1470 

20 Ser Ala Asp Lys Phe Glu Val Ser Ala Asp Ser Ser Thr Ser Lys Asn 

J\ 1475 1480 1485 

J" Lys Glu Pro Gly Val Glu Arg Ser Ser Pro Ser Lys Cys Pro Ser Leu 

O 1490 1495 1500 

i 5 4 Asp Asp Arg Trp Tyr Met His Ser Cys Ser Gly Ser Leu Gin Asn Arg 

115 1505 1510 1515 152 

O Asn Tyr Pro Ser Gin Glu Glu Leu He Lys Val Val Asp Val Glu Glu 

H 1525 1530 1535 

Gin Gin Leu Glu Glu Ser Gly Pro His Asp Leu Thr Glu Thr Ser Tyr 
1540 1545 1550 

30 Leu Pro Arg Gin Asp Leu Glu Gly Thr Pro Tyr Leu Glu Ser Gly He 

1555 1560 1565 

Ser Leu Phe Ser Asp Asp Pro Glu Ser Asp Pro Ser Glu Asp Arg Ala 

1570 1575 1580 

Pro Glu Ser Ala Arg Val Gly Asn He Pro Ser Ser Thr Ser Ala Leu 
35 1585 1590 * 1595 160 

Lys Val Pro Gin Leu Lys Val Ala Glu Ser Ala Gin Gly Pro Ala Ala 

1605 1610 1615 

Ala His Thr Thr Asp Thr Ala Gly Tyr Asn Ala Met Glu Glu Ser Val 
1620 1625 1630 

40 Ser Arg Glu Lys Pro Glu Leu Thr Ala Ser Thr Glu Arg Val Asn Lys 

1635 1640 1645 

Arg Met Ser Met Val Val Ser Gly Leu Thr Pro Glu Glu Phe Met Leu 

1650 1655 1660 

Val Tyr Lys Phe Ala Arg Lys His His He Thr Leu Thr Asn Leu He 
45 1665 1670 1675 168 

Thr Glu Glu Thr Thr His Val Val Met Lys Thr Asp Ala Glu Phe Val 
1685 1690 1695 
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Cys Glu Arg Thr Leu Lys Tyr Phe Leu Gly He Ala Gly Gly Lys Trp 

1700 1705 1710 

Val Val Ser Tyr Phe Trp Val Thr Gin Ser He Lys Glu Arg Lys Met 

.1715 1720 1725 

Leu Asn Glu His Asp Phe Glu Val Arg Gly Asp Val Val Asn Gly Arg 

1730 1735 1740 

Asn His Gin Gly Pro Lys Arg Ala Arg Glu Ser Gin Asp Arg Lys He 
1745 1750 1755 176 

Phe Arg Gly Leu Glu He Cys Cys Tyr Gly Pro Phe Thr Asn Met Pro 

1765 1770 1775 

Thr Asp Gin Leu Glu Trp Met Val Gin Leu Cys Gly Ala Ser Val Val 

1780 1785 1790 

Lys Glu Leu Ser Ser Phe Thr Leu Gly Thr Gly Val His Pro He Val 

1795 1800 1805 

Val Val Gin Pro Asp Ala Trp Thr Glu Asp Asn Gly Phe His Ala He 

1810 1815 1820 

Gly Gin Met Cys Glu Ala Pro Val Val Thr Arg Glu Trp Val Leu Asp 
1825 1830 1835 184 

Ser Val Ala Leu Tyr Gin Cys Gin Glu Leu Asp Thr Tyr Leu He Pro 

1845 1850 1855 

Gin He Pro His Ser His Tyr 
1860 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
AATCTTAGAG TGTCCCA 17 
(2} INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
ATCTTAGTGT CCCACCT 17 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
CAGAAAAAAA GGTAGAT 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

CAGAAAAAAA AGGTAGA 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 
AGAGAATCCC AGGACAG 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
AGAGAATCCC CAGGACA 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

AGGACCTGCG AAATCCA !7 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AGGACCTGTG AAATCCA 17 
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